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Abstract: G protein-coupled receptor (GPCR) structures are of interest as a means to 
understand biological signal transduction and as tools for therapeutic discovery. The 
growing number of GPCR crystal structures demonstrates that the extracellular loops (EL) 
connecting the membrane-spanning helices show tremendous structural variability relative 
to the more structurally-conserved seven transmembrane a-helical domains. The EL of the 
LPAi receptor have not yet been conclusively resolved, and bear limited sequence identity 
to known structures. This study involved development of a peptide to characterize the 
intrinsic structure of the LPAi GPCR second EL. The loop was embedded between two 
helices that assemble into a coiled-coil, which served as a receptor-mimetic folding 
constraint (LPAi-CC-EL2 peptide). The ensemble of structures from multi-dimensional 
NMR experiments demonstrated that a robust coiled-coil formed without noticeable 
deformation due to the EL2 sequence. In contrast, the EL2 sequence showed well-defmed 
structure only near its C- terminal residues. The NMR ensemble was combined with a 
computational model of the LPAi receptor that had previously been validated. The 
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resulting hybrid models were evaluated using docking. Nine different hybrid models 
interacted with LP A 18:1 as expected, based on prior mutagenesis studies, and one was 
additionally consistent with antagonist affinity trends. 

Keywords: G protein-coupled receptor; GPCR; lysophosphatidic acid; LP A; NMR; 
GPCR segment model 



1. Introduction 

G protein-coupled receptors (GPCRs) are a large family of integral membrane proteins that play 
critical roles in cellular signaling. Members of this family are common drug targets, estimated to be the 
target of up to 50% of drugs [1,2]. Family members not currently targeted by existing drugs are the 
subject of extensive drug discovery research programs. The lysophosphatidic acid (LP A) receptors are 
among this latter group of GPCR due to their roles in cellular survival, migration, proliferation, and 
apoptosis [3-5]. These LPAi receptor-mediated actions have been linked to the survival and metastasis 
of several cancer cell types [6-8]. This therapeutic relevance of LPAi stimulated our development and 
validation of an LPAi receptor model to guide the identification of LPAi receptor antagonists [9-13]. 

Substantial interest in the structures of GPCR to accelerate drug candidate discovery has so far led 
to over a dozen different nearly-complete class A (rhodopsin-like) GPCR with published 
crystallographic [14—27] or solid-state NMR [28,29] structures (some in multiple states or with a 
variety of ligands, rhodopsin alone is represented in approximately 20 different PDB entries as 
reviewed in [30]), and numerous structures of the iV-terminal domain of class B (secretin family) and 
C (metabotropic glutamate/pheromone) GPCR family members (reviewed in [30]). Figure 1 
demonstrates that representative crystallized class A GPCR structures show strong structural 
conservation fi-om the intracellular end of the helical bundle up toward the last third of the extracellular 
end of the helical bundle. Extensive structural diversity is displayed among the loops exposed to the 
extracellular space, particularly the longer second (EL2, highlighted in Figure 1) and third (EL3) 
extracellular loops. Among the class A GPCR family members shown, only three EL2 architectures 
are repeated and six additional architectures are each present in only one family member. One repeated 
architecture is exhibited by the pi and P2 adrenoceptors. A second is common to the M2 and M3 
muscarinic acetylcholine receptors. The third is shared by the CXCR4 chemokine receptor and the 
opioid receptors (two of four opioid receptors shown in Figure 1). Overall sequence identity between 
the receptors with common structure is 55% or higher. This superposition indicates that comparative 
models of GPCR, such as our model of LPAi, will be highly accurate at the intracellular half of the 
helical bundle, moderately accurate at the extracellular half of the helical bundle, and poorly accurate 
in the loop segments connecting the transmembrane segments, unless a close homolog exhibiting a 
shared architecture is known. Experimental characterization of the highly variable connecting loop 
segments is, therefore, of value to improve the quality of comparative models and provide structural 
insights into the source of ligand selectivity. 
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Figure 1. Superposition of twelve crystallized GPCR family members. Rhodopsin 
(1F88 [21], orange), p2-adrenoceptor (2RH1 [19], magenta), pi -adrenoceptor (2VT4 [18], 
cyan), adenosine A2a (3EML [17], pink), dopamine D3 (3PBL [15], brown), chemokine 
CXCR4 (30eO [16], green), histamine HI (3RZE [14], mustard), muscarinic acetylcholine 
M2 (3U0N [22], brick red), muscarinic acetylcholine M3 (4DAJ [33], grey), K-opioid 
(4DJH [23], red), |a-opioid (4DKL [24], purple), and SlPi (3V2Y [25], blue) are shown 
using ribbon representations. Left: Complete backbone rendered as ribbon with the exception 
of T4 lysozyme replacements for IL3. The four TM segments closest to the viewer are 
labeled. Right: Two views of EL2 and extracellular ends of surrounding helices with 
structures separated into two groups for clarity. 




Numerous structural characterization studies on sequence fragments from GPCR have been 
performed (reviewed in [30]). These studies indicate that carefully engineered designs including 
disulfide bonds or self-assembling coiled-coils can be used to promote folded structures, in contrast to 
isolated loop fragment sequences which are generally relatively extended and flexible. We previously 
reported the structure of the SIP4 first extracellular loop (SIP4-ELI) using an antiparallel coiled-coil 
and disulfide bond to provide a folding constraint [31]. This design required 20% trifluorethanol to 
promote helicity due to the relatively short coiled-coil sequences used. Nevertheless, this prior design 
led to the identification of a short 3^" helical segment in the SIP4 first extracellular loop, and 
demonstrated the ability of the peptide to selectively recognize a truncated version of the natural 
agonist (phosphoethanolamine) but not the truncated version of a related non-agonist lipid 
(A^-acylethanolamine). The relevance of this structural feature in the full-length receptor is supported 
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by the recent crystallographic structure of the closely-related SlPi receptor, which shows a short helix 
in the corresponding location. In the current study, we report on the structure of the substantially 
longer LPAi second extracellular loop sequence in aqueous buffer using a longer optimized 
antiparallel coiled-coil [32] to promote self-assembly in the absence of both trifluoroethanol and an 
interhelical disulfide bond. This improved second-generation design was applied to characterize the 
intrinsic structure of the LPAi second extracellular loop sequence in aqueous solution, demonstrating 
that this loop contains no intrinsic secondary structure elements, but does exhibit a strong outward 
bend at the C-terminal end leading into the C-terminal helix. 

2. Results and Discussion 

2.1. Peptide Design 

In order to investigate loop conformations, a coiled-coil scaffold was constructed to provide a 
folding constraint for the particular loop of interest. Oakley et al. have previously studied designed 
peptide sequences for their ability form parallel and anti-parallel coiled-coils [32]. The ability of 
peptide pairs to form a coiled-coil was confirmed using CD. The placement of two Asn residues was 
important in distinguishing a parallel versus an anti-parallel coiled-coil. In this study we used their 
sequences along with EL2 of LPAi in a continuous sequence (LPAi-CC-EL2) which formed an 
anti-parallel coiled-coil due to the placement of two Asn residues in positions a' and d (Figure 2). EL2 
was chosen for this study since it has a wide variety of structures across the different GPCR (Figure 1). 



Figure 2. Sequence of LPAi-CC-EL2 shown with helical wheels for the coiled-coil segment. 
Nl 1 and N71 have been highlighted to show that they should occur in cross positions. 
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2.2. Circular Dichroism (CD) 



Circular dichroism spectra were collected in order to define an appropriate buffer for NMR studies, 
in which the LPAi-CC-EL2 peptide exhibited the predominantly helical structure expected on the basis 
of the design. Once an appropriate buffer was selected, thermal denaturation was performed using 
220 nm to follow denaturation of the a-helical secondary structure. Figure 4 shows a wavelength scan 
at 20 °C as well as thermal denaturation. Deconvolution of the secondary structure using the CDSSTR 
algorithm with reference set 7 [34] on Dichroweb [35-37] indicates the LPAi-CC-EL2 peptide is 65% 
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helical, 11% strand, 9% turns, and 14% disordered. Thermal denaturation data demonstrated that 
LPAi-CC-EL2 unfolding was not completely cooperative, as a plateau was apparent at about 90 °C. 
The CD thermal denaturation of the LPAi-CC-EL2 peptide (Figure 3) is consistent with the thermal 
denaturation studies performed by Oakley et al. [32] on the separate coiled-coil sequences as an equimolar 
mixture, which showed a single thermal denaturation below 80 °C (the highest temperature tested). 

Figure 3. LPAi-CC-EL2 CD spectra as a function of (A) wavelength and (B) temperature. 
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2.3. Chemical Shift Assignments and Helical Calculations 

Sequence-specific chemical shift assignments for LPAi-CC-EL2 were obtained from the interactive 
use of the backbone 3D NMR experiments listed in experimental methods. These chemical shifts were 
then used to obtain the sidechain assignments (aromatics unassigned). Chemical shifts have been 
deposited in the BioMagResBank under accession number 17993. The backbone nitrogen of E26 has 
gone unassigned; however, its sidechain chemical shifts were obtained using back correlations from 
K27. The TALOS program provided 1 19 O and ^ dihedral angles from the chemical shift data. A total 
of 715 ^H-^H distance resfraints (NOEs) from the 3D ^^N-NOESY-HSQC and '^C-NOESY-HSQC 
experiments were employed in the CNS structure calculation. Four NOE peak intensity categories were 
used to classify infra 'H-^H upper distance limits; sfrong (<2.7 A), medium (<3.3 A), weak (<4.0 A), and 
very weak (<4.5 A). These two types of consfraints were used in the initial CNS structure calculations. 
Once a calculation was completed, the NOEs and dihedrals were assessed based on violations. If a 
distance or dihedral was violated, it was redefined to the next weakest category for NOEs or 5° was 
added to the dihedral range. This continued until there were no NOE violations greater than 0.5 A and 
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no dihedral violated by greater than 10°. At this point another calculation was performed that defined 
24 hydrogen bonds down the two helices to further stabilize their structures. 

2.4. Coiled Coil 

Initial structure calculations employed short and medium range NOEs to refme the two separate 
helical structures. Both TALOS and chemical shift analysis (data not shown) indicated helical structure 
from residues 3-31 and 50-78. Once these structures converged, the coiled-coil was initially defined 
by 10 distances observed in the '^C-NOESY-HSQC data. Specifically, very weak distances were 
defined between the methyl groups of L4 to L74, LI 8 to L60, L22 to L64, L25 to L53 and L29 to L57 
(two distances for each pair). These distances represented the only unique chemical shift pairs in the 
predicted coiled-coil since most of the sidechain leucine interactions were overlapped. However, this 
proved to be sufficient to define the coiled-coil structure. One key feature in the design of this 
coiled-coil was the presence of Nil and N71. The positions of these residues are important to promote 
an anti-parallel coiled-coil. There were two NOEs observed in the ^^N-NOESY-HSQC, one from the 
NH2 group of Nl 1 to the methyl of L74 and the other fi-om the NH2 group of Nl 1 to the NH2 group of 
N71. First, a calculation was performed that defined the distance for Nil to L74. In all the resulting 
structures from previous coiled-coil calculations the Nl I and N7I residues were at positions a' and d 
which indicated an anti-parallel coiled-coil. Also the sidechain NH2 group of Nil pointed to the 
sidechain O of N71 which still placed the two NH2 groups close enough to show a weak NOE, 
therefore another calculation contained this distance restraint. 

2.5. Loop 

The last few rounds of calculations employed short and medium range distance restraints in the 
loop. Most of the loop residues did not result in any NOEs in either the ^^N-NOESY-HSQC or 

1 o 

C-NOESY-HSQC suggesting a complete lack of structure in these residues of the loop. However, 
there were short range (/ to / and / to / ± 1) ^^N-NOEs observed from residues Ala45 through Ala52. 
There were also three medium range ^^N-NOEs involving Pro46 through Ala52 (Figure 4). The last 
calculation included these NOEs which produced an interesting conformation in this part of the loop 
(see Section 2.6). The final round of structure calculations consisted of 100 structures. Table 1 shows 
statistical information used for and resulting from the final calculation of the 58 lowest energy 
structures out of the 100 used in the final round of structure calculations. Of these 58 lowest energy 
structures, 36 representatives for the LPAi-CC-EL2 were superimposed as shown in Figure 5. 
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Figure 4. ^^N-NOE contacts of Y48 with P46, L47, Y48, and A52 that define the outward 
bend at the C-terminal end of LPAi EL2. 

Y48 




Table 1. Statistics from the CNS structure family of LPAi-CC-EL2. 



Distance Restraints 

No. of total restraints 858 

No. of intraresidue (/ =j) 274 

No. of interresidue {{i -j) = 1, (f +y) = 1) 279 

No. of medium (1< (f -j) < 5) 150 

No. oflong((/-7)>5) 12 

No. of dihedral-angle constraints (O and *P) 119 

No. of hydrogen bonds 24 

Etotai (kcal/mol) 3 149.26 ± 32. 19 
RMSDs from experimental restraints (CNS defaults) 

Bonds (A) 0.0079 ± 0.0002 

Angles (°) 1.141 ±0.021 

Impropers (°) 1 .099 ± 0.032 

NOEs 0.1 19 ±0.002 

Dihedrals 0.677 ±0.139 

Coordinate superimpose 

Backbone RMSD (A) (2-79) 4.01 

Backbone RMSD (A) (3-30) 0.63 

Backbone RMSD (A) (53-78) 0.58 

Backbone RMSD (A) (3-30 and 53-78) 0.95 
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Figure 5. Superimposition of 36 representative low energy structures from the 
CNS calculation. Left: A^-terminal helix on the left and C-terminal helix on the right, 
Center: A^-terminal helix on the right and C-terminal helix on the left, and Right: side view 
with the C-terminal helix in front. A single representative structure has also been shown for 
a simplified view. 



2.6. Overall Structure 

The overall structure of LPAi-CC-EL2 does not converge to a single family (backbone RMSD 4.01 A) 
due to a lack of NOEs in the loop. However, each helix does show convergence (backbone RMSD 
0.63 A and 0.58 A respectively) as does the coiled-coil (backbone RMSD 0.95 A). The RMSD values 
for the bonds, angles, impropers, NOEs, and dihedrals are acceptably low (<3% of values for bonds, 
angles, impropers and NOEs, -20% of value on dihedrals). Overall, >80% of the residues of the 
LPAi-CC-EL2 family fall in favored or allowed regions of the Ramachandran plot indicating a well 
calculated structure. The set of 36 structures in Figure 5 were submitted to the Research CoUaboratory 
for Structural Bioinformatics (RCSB) Protein Data Bank (PDB) with the RCSB ID code RCSB 102692 
and PDB ID code 2LQ4. 

The results of these calculations as well as the CD wavelength spectra and thermal denaturation 
show that the peptide sequences from Oakley et al. [32] do indeed form a coiled-coil when connected 
by a linking peptide. These sequences, therefore, provide an exceUent scaffold within which loop 
conformations can be investigated. These results show that the EL2 conformation of the LPAi receptor 
does not contain any well-defined secondary structure, analogous to EL2 in six of the class A GPCR 
members for which crystal structures are available (Figure 1). However, there was an interesting result 
in the conformation around residues P46 to A52, due to NOE contacts from the sidechains of residues 
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P46 (P-hydrogen), Y48 (5- and s- hydrogens), and A52 (P-hydrogen) to the backbone amide hydrogens 
of residues L47, Y48, and D50 (Figure 4). Although the sidechain of coiled-coil residue A52 is 
involved in this network of contacts, the P-hydrogens of the tyrosine residue that occurs in full-length 
LPAi could make the same contacts, thus this conformation appears to be an intrinsic property of the 
LPAi EL2 loop sequence. Due to these NOE contacts, this part of the loop was always in a bent 
conformation (Figure 5); however it did not converge into a single conformation suggesting that this 
part of the loop is limited to a small amount of conformational space while the rest of the loop has a 
greater range of conformational freedom. This would represent the carboxy-terminal end of the loop 
transitioning into helix 5 of the LPAi receptor. The resulting structures from these calculations were 
superimposed into the corresponding helix4-EL2-helix5 of our LPAi receptor model in order to 
illustrate how this bend in EL2 fit onto the overall structure (Figure 6) and to provide starting 
structures for hybrid LPAi receptor model development. Of these 58 structures, only 2 showed any 
overlap of the loop with atoms of the receptor. The bend in the loop places the loop in an outward 
conformation that leaves the top of the receptor open, potentially to accommodate a structured segment 
within the A^-terminus as observed in the recently characterized SlPi receptor [25]. The outward bend 
of EL2 is also observed in the recently characterized M2 muscarinic acetylcholine receptor although at 
the amino-terminal end of EL2 rather than the carboxy-terminal end (Figure 7). 

Figure 6. Superimposition of the calculated structures on helix 4 and 5 of the validated 
LPAi receptor model. The LPAi receptor model is shown as a magenta ribbon with TMl 
on the left side, the A^-terminal ends of the LPAi-CC-EL2 peptide structures are shown as 
red ribbons (superposed on TM4 of the LPAi model), the C-terminal ends as blue ribbons 
(superposed on TM5 of the LPAi model), and segments superposed on the LPAi receptor 
model are shown as yellow ribbons. 




2. 7. Hybrid LPA i Receptor Models 

The 58 lowest-energy LPAi-CC-EL2 NMR structures were each superposed on the validated LPAi 
receptor model in order to produce a hybrid NMR/model structiare by multi-template homology 
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modeling (Figure 6). The ELI portion of the LPAi receptor model had previously been remodeled on 
the basis of the NMR structure of our shorter coiled-coil ELI mimetic of the SIP4 receptor [31], which 
exhibits a short 3'" helical turn at the same location as a short helix noted in the recent SlPi crystal 
structure. Thus the 58 resulting models are hybrids of the NMR structure data for LPAi-CC-EL2, 
SIP4-ELI, and the original template LPAi receptor model. One of the resulting hybrid LPAi receptor 
model structures showed EL2 wrapped around EL3 and was eliminated from further consideration. 
Seventeen additional structures showed very close contacts (<1.6 A) between atoms in the EL2 

3 28 3 29 

structure and residues R124 or Q125 at the extracellular end of TM3 (Figure 8), which have been 
demonstrated by previous modeling and mutagenesis to form required interactions with the natural 
agonist, LPA. These close interactions would prevent LPA interaction at these sites, so these 17 models 
were also eliminated from fiarther consideration. The remaining 40 models are shown in Figure 8. 

Figure 7. Comparison of outward bends in LPAi-CC-EL2 NMR structures (A^-terminal 
helix: red, C-terminal helix: blue, EL2: green) and EL2 from the M2 muscarinic acetylcholine 
receptor crystal structure (3U0N [22], brick red). 



2.8. Docking Results 

Docking studies were performed for the ligands shown in Figure 9 using each of the 40 hybrid 
LPAi receptor models from Figure 8 as docking targets. These studies were performed in order to 
determine if specific members of the NMR ensemble were compatible with known features of the 

3 28 3 29 

fiill-length LPAi receptor. LPA 18:1 is expected to interact with residues R124 and Q125 at the 
extracellular end of TM3, as mutations of either of these residues to alanine produces receptors that are 
either only weakly activated or are not activated by LPA 18:1 [12]. The corresponding sites in the 

3 28 3 29 

closely-related SI Pi receptor, R120 and E121 , have been demonstrated to interact with the 
phosphate and ammonium groups of a co-crystallized antagonist [25]. Thirty-one models were 
eliminated from further consideration due to high-ranking poses of LPA with phosphate groups placed 

3 28 3 29 

in the lower end of the binding pocket, distant from both R124 and Q125 . Such a result suggests 
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that hybridization of that particular loop geometry has altered the headgroup recognition pocket, and 
therefore the resulting hybrid does not represent a biologically meaningful model of the full-length 
receptor. The remaining nine models in which the top-ranked LPA pose exhibited a headgroup position 

3 28 3 29 

near R124""° and Q 125" "^ are shown in Figure 10. Figure 11 illustrates the consistency of complexes 
docked into the hybrid model generated from chain 3 in PDB entry 2LQ4 with the known antagonist 
pharmacology. The active antagonists, while all having a buried carboxylate functional group, have 
that polar functional group in a relatively polar subpocket formed by S3. 39, S7.46, and N7.45 (panel B). 
In contrast, the inactive compound, NSC 47091, buries two carboxylic acid functional groups in 
relatively hydrophobic pockets, including S5.37 as the only polar sidechain (panel C). 

Figure 8. Hybrid NMR/Model LPAi Receptor Structures. Receptor backbone from the 
previously validated LPAi receptor model is rendered as a ribbon colored from blue at the 
amino terminus to red at the carboxy terminus. EL2 segment structures modeled based on 
the NMR structures of LPAi-CC-EL2 are colored green. R124^^^ and QUS^-^"^ from one 
model are shown as sticks for clarity (all-atom RMSD for atoms in these amino acids in the 
40 models is 0.9 A). 




2.9. Full-length Sequence Context 

The present study demonstrates that the sequence found in the LPAi EL2 is largely flexible and 
unstructured, even when the termini are held close together by a coiled-coil scaffold. Two 
characteristics that might reduce the conformational space available to this loop in the frill-length 
receptor are disulfide bonds and contacts with other extracellular segments including ELI, EL3, or the 
amino terminus. The recently crystallized SlPi structure has an EL2 identical in length which exhibits 
48% residue identity and 57% homology to the LPAi EL2, and is the only class A GPCR crystallized 
to date that lacks an interloop disulfide bond between cysteine residues in ELI and EL2. SlPi instead 
exhibits an intraloop disulfide bond between two cysteine residues in EL2 and a second intraloop 
disulfide bond between two cysteine residues in EL3. A superposition of the NMR ensemble on the 
SlPi EL2 is shown in Figure 12. Like SlPi, the LPAi receptor lacks cysteine in ELI, and exhibits 
conservation of the cysteine residues involved in the internal EL2 disulfide bond (CI 84 and C191 in 
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LPAi) [38]. However, LPAi additionally contains a third cysteine residue in both EL2 (CI 86) and EL3 
that is not found in SlPi. The additional cysteine residues prevent direct inference of matching 
disulfide bonds in SlPi and LPAi. However, it is likely that one or more of the three LPAi EL2 
cysteine residues participates in intraloop or interloop disulfide bonds in the context of the fiiU-length 
receptor that would provide an additional conformational restriction relative to the LPAi-CC-EL2 
peptide characterized with cysteine residues in reduced form. However, it is not clear at this time if the 
LPAi EL2 structure in the context of the full-length LPAi receptor should closely match that of SlPi 
due to the differences in number of cysteine residues noted. The NMR ensemble of LPA1-CC-EL2 
showed no consistent Cys-Cys close contacts that suggest an intrinsic disulfide pairing preference. 



Figure 9. Chemical structures used in docking studies, 
o o 




Figure 10. Hybrid LPAi rnodel complexes with LPA (stick) headgroup placed near 

3 28 3 29 

R124 and Q125 (ball & stick). EL2 segment shown as green ribbons, remainder of 
LPAI colored from blue at the amino terminus to red at the carboxy terminus. 
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Figure 11. Consistency of LPAi hybrid model generated from chain 3 of 2LQ4 with 
pharmacological trends. LPAi is shown as ribbons shading from blue at the amino terminus 
to red at the carboxy terminus in all panels. R3.28 and Q3.29 are shown as ball&stick 
models in all panels. Antagonists are colored according to potency, with the most potent 
antagonist H2L5765834 colored green, followed by H2L5 105099 (blue-green), 
H2L7724589 (yellow), H2L5226501 (orange), and the inactive compound NSC47091 
(red). LPA is shown using element colors in all panels. (A) Superposition of all docked 
hgands; (B) Superposition of three most active antagonists showing residues within 4.5 A 
of the buried carboxylate groups as either lines (hydrophobic sidechains) or labeled sticks 
(polar sidechains); (C) Docked pose of inactive compound showing residues within 4.5 A 
of the buried carboxylate groups as either lines (hydrophobic sidechains) or labeled sticks 
(polar sidechains). 
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Figure 12, Comparison of LPAi-CC-EL2 ensemble (A^-terminal helix: red, C-terminal 
helix: blue, EL2: green) to the SlPi EL2 (3V2Y [25], yellow). The disulfide bonded 
cysteine residues in the SlPi EL2 are shown as spacefilling models. 




3. Experimental Section 

3.1. Sample Preparation 

The second extracellular loop of LPAi was inserted between helical segments known to form an 
antiparallel coiled-coil to form a synthetic protein sequence, termed LPAi-CC-EL2. LPAi-CC-EL2 
DNA was prepared by PCR using 7 overlapping oligonucleotides and two terminal primers, and 
incorporated into pET-32 Ek/LIC (EMD Millipore) using ligation-independent cloning to produce the 
plasmid containing a hexahistidine tag. The resulting expression vector was confirmed with DNA 
sequencing (Molecular Resource Center, UTHSC, Memphis, TN) and then transformed into E. coli 
BL21(DE3) cells for protein expression. Cell cultures were grown in minimal media containing 1 L of 
H2O, 2.0 g/L of ^^Ce-glucose as the sole source of "C and 1.0 g/L of '^NH4C1 as the sole source 
This growth was performed as previously described, with no alterations [31]. Cells were grown at 37 °C 
and protein expression induced with 0.1 mM IPTG at an ODeoo of 0.6. The cells were pelleted by 
centrifugation and stored at -80°C. The cell pellet was lysed using the CelLytic B Plus Kit 
( Sigma- Aldrich, MI, USA) and loaded directly onto a Ni-NTA affinity column (GE Healthcare, 
New Jersey, NJ, USA), washed with 40 mM imidazole in 20 mM sodium phosphate buffer, pH 7.4, 
containing 500 mM sodium chloride followed by elution with 500 mM imidazole in the same buffer. 
The desired peptide (LPAi-CC-EL2) was then cleaved from the His-tag using recombinant 
enterokinase (EMD Biosciences) and purified by HPLC using a reverse-phase CI 8 preparative 
column. Anion-exchange chromatography was then used to separate the desired protein from 
nonspecific cleavage products, followed by confirmation using ESI-mass spectroscopy and sodium 
dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE). The final protein sample was 
concentrated to either 0.1 mM (for CD thermal denaturation) or 0.5 mM (for NMR studies), and the 
solvent exchanged to 50 mM Tris-HCl, 1.0 mM DTT, 1 mM sodium azide, pH 6.0 and 10% D2O. 
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3.2. Circular Dichroism (CD) Data 

All CD data were acquired on an Aviv 410 spectropolarimeter using a 1 mm cell. Wavelength scans 
were collected for 0.1 mg/mL samples at 1 nm increments over a wavelength range of 190-260 nm at 
room temperature. Thermal denaturation experiments were performed over a temperature range from 
20 °C to 110 °C at a wavelength of 220 nm. Ellipticities (c])) measured during thermal denaturation 
were converted to % unfolded at each temperature (7) based on the assumption that the structure was 
fully folded at the lowest temperature (rmin) and fully unfolded at the highest temperature (Tmax) 
using Equation 1 [39]: 



3.3. NMRData 

All NMR data were acquired at 25 °C on a Varian VNMRS 500 MHz spectrometer using a 5 mm 
triple-resonance 'H, ^^C, '^N probe with z-axis gradients. The sequential backbone assignments 
were accomplished through the interpretation of the experiments: HNCA, HN(CO)CA, HNCACB, 
CBCANH, HNCO, HN(CA)CO and ^^N-NOESY-HSQC. Sidechain assignments were obtained 
through the H(CCO)NH-TOCSY, (H)C(CO)NH-TOCSY and HCCH-TOCSY experiments. NOE data 
was interpreted from the ^^N-NOESY-HSQC and ^^C-NOESY-HSQC experiments, both with a 150 ms 
mixing time. Data was processed using NMRPipe [40] software and analyzed with PIPP [41]. 

3.4. Structure Calculations 

NMR chemical shift values for ^^N, ^H", ^^C", ^^C^, and ^^C were used in the TALOS program [42] for 
predictions of O and ^ dihedral angles. NOE distances were obtained from the 3D ^^-NOESY-HSQC 
and C-NOESY-HSQC experiments. Dihedral angle and NOE restraints were used as input for the 
structure calculation with the crystallography and NMR system (CNS version 1.1) program [43]. A 
series of structure calculations ended when a family of structures was obtained in which no NOE 
distance was violated by >0.5 A, no dihedral angle was violated by >10°, and low energies and good 
convergence were obtained. 

3.5. Hybrid Receptor Model Development 

Hybrid LPAi receptor models were developed using a previously validated model of the LPAi 
receptor [9,12,44] and the NMR structures of LPAi-CC-EL2 as input for multi-template homology 
modeling by the MOE software [45]. The hybrid models were generated using residues Leu29-Gln51 
from the NMR structures of LPAi-CC-EL2 as the basis for modeling residues Serl83-Ser205 and the 
previously published LPAi receptor model as the basis for modeling the remainder of the structure. 
The resulting models did not include residues 1-35 and 332-364 {N- and C-termini). 




(1) 
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3.6. Docking 

LP A 18:1 and several compounds exhibiting varying potencies as LP A receptor antagonists 
(H2L5765834 Ki = 48 nM; H2L5105099 Ki = 50 nM; H2L7724589 Ki = 311 nM; H2L5226501 
maximal inhibition of 59% at 30 |a,M and NSC47091 inactive, structures shown in Figure 9) [46,47] 
were docked into each hybrid LPAi receptor model using Autodock Vina. The docking box was centered 
between the sidechains of W271^''*^ and 8304^ '^^ and extended 40 A along the long axis of the model and 
22 A along each of the other two axes. Top-ranked poses obtained using searches with exhaustiveness set 
to 16 were analyzed for consistency with known relative potencies and mutagenesis results. 

4. Conclusions 

In conclusion, this study has demonstrated that the two peptide sequences designed by Oakley et ah, 
to form an antiparallel coiled-coil dimer in an aqueous solution, can also self-assemble within a single 
peptide sequence that included a long linker of 19 amino acids from the LPAi EL2. The coiled-coil 
motif provided a receptor-mimetic folding constraint for the longest LPAi extracellular loop. This 
folding constraint made possible atomic-resolution structural studies of the LPAi EL2 in solution, 
permitting any intrinsic flexibility of the receptor loop sequence to be reflected in the structural data. 
Inclusion of the coiled-coil is a feasible alternative to performing NMR structural studies of full-length 
GPCR proteins embedded in membranes or detergents, although with the limitation that structural 
features dependent on interactions with other loops will not be observed. The NMR data collected for 
the LPAi-CC-EL2 sequence produced a well-converged coiled-coil structure for these designed 
segments of the overall peptide sequence. In contrast, the native LPAi EL2 loop sequence positioned 
between these coiled-coil sequences showed convergence only at its C-terminal end, which exhibited 
an outward bend relative to the loop's position within the full-length LPAi receptor. This lack of 
structural convergence for the remainder of the loop sequence suggests high intrinsic flexibility under 
reducing conditions. The majority of structures that were produced during the model generation and 
refinement process were used in generating hybrid full-length LPAi models without excessive 
disruption of validated portions of the original model. However, only nine of these hybrid models 
produced top-ranking docking poses for LP A 18:1 that were consistent with mutagenesis studies 
demonstrating that R120 and E121 are essential for LPA 18:1 activity. Figure 13 compares top-down 
views of a model inconsistent with mutagenesis and that shown in Figure 11. This top view 
demonstrates that residues in EL2 can influence the relative orientation of the R120^^** sidechain, and 
therefore the binding mode of LPA identifled during docking. Figure 13A shows that CI 90 in EL2 
(corresponding to C36 in the LPAi-CC-EL2 sequence shown in Figure 2) forms a hydrogen bond with 
the R120^^^ sidechain in the hybrid model, tipping the hydrogen bond donors up and away from the 
ligand binding pocket relative to the model shown in Figure 13B. This produces a more hydrophobic 
pocket, reflected in the docked positions of LPA in this model with the hydrophobic tails positioned 

3 28 

near R120 . It is quite possible that consideration of sidechain flexibility during docking may have 
allowed the model shown in Figure 13A to produce docked poses more consistent with mutagenesis 
results. One model was consistent not only with validated LPA interaction sites, but also showed 
consistency with pharmacological trends of five compounds with varied antagonist activity. Future 
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studies will need to address what pattern of disulfide bond linkages occurs among the six cysteine 
residues found in EL2 and ELS of LPAi in order to further refine our understanding of the 
conformations this sequence is able to explore in the context of the flill-length receptor, as the 
ensemble of structures did not show a natural propensity for specific cysteine residues to be in 
proximity to each other. 

Figure 13, Comparison of ligand binding pockets from a model inconsistent with 
mutagenesis (panel A) and from Figure 1 1 (panel B). LPAi is shown as ribbons shading 
irom blue at the amino terminus to red at the carboxy terminus in all panels. R3.28 is 
shown as a spacefilling model, with other residues and LPA shown using stick models in 
all panels. 
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